The Effects of Factorizing Root and Pattern Mapping in Translating between Tunisian Arabic and Standard Arabic
نویسندگان
چکیده
The development of natural language processing tools for dialects faces the severe problem of lack of resources. In cases of diglossia, as in Arabic, one variant, Modern Standard Arabic (MSA), has many resources that can be used to build natural language processing tools. Whereas other variants, Arabic dialects, are resource poor. Taking advantage of the closeness of MSA and its dialects, one way to solve the problem of limited resources, consists in performing a translation of the dialect into MSA in order to use the tools developed for MSA. We describe in this paper an architecture for such a translation and we evaluate it on Tunisian Arabic verbs. Our approach relies on modeling the translation process over the deep morphological representations of roots and patterns, commonly used to model Semitic morphology. We compare different techniques for how to perform the cross-lingual mapping. Our evaluation demonstrates that the use of a decent coverage root+pattern lexicon of Tunisian and MSA with a backoff that assumes independence of mapping roots and patterns is optimal in reducing overall ambiguity and increasing recall.
منابع مشابه
‘Repetition’ in Arabic-English Translation: The case of Adrift on the Nile
Abstract This study investigates ‘repetition’ in the English translation of the Arabic Novel, Adrift on the Nile (1993). It aims to explore the communicative functions of ‘repetition’ and to see if these functions have been maintained or lost in the process of translating the Novel. In addition, it seeks to find the translation strategies used in rendering ‘repetition’. To achieve this aim, a d...
متن کامل‘Repetition’ in Arabic-English Translation: The case of Adrift on the Nile
Abstract This study investigates ‘repetition’ in the English translation of the Arabic Novel, Adrift on the Nile (1993). It aims to explore the communicative functions of ‘repetition’ and to see if these functions have been maintained or lost in the process of translating the Novel. In addition, it seeks to find the translation strategies used in rendering ‘repetition’. To achieve this aim, a d...
متن کاملMorphological structure in the Arabic mental lexicon: Parallels between standard and dialectal Arabic
The Arabic language is acquired by its native speakers both as a regional spoken Arabic dialect, acquired in early childhood as a first language, and as the more formal variety known as Modern Standard Arabic (MSA), typically acquired later in childhood. These varieties of Arabic show a range of linguistic similarities and differences. Since previous psycholinguistic research in Arabic has prim...
متن کاملGeo-cultural pattern of Islamic Republic of Iran regarding to Arabic uprising in Middle East (2014-2011)
Since 2011, the region has been a profound socio-economic changes originated from n Tunisia, & spread to Middle Eastern and the former power structures affected. The management and direction to these uprisings is the key question of this paper. The key question, is the foreign policy of the Islamic Republic of Iran how answer to these Middle East public uprisings in the years 2011-2014? <...
متن کاملRevisiting the Arabic Diglossic Situation and Highlighting the Socio-Cultural Factors Shaping Language Use in Light of Auer’s (2005) Model
In the field of Arabic sociolinguistics, diglossia has been an interesting linguistic inquiry since it was first discussed by Ferguson in 1959. Since then, diglossia has been discussed, expanded, and revisited by Badawi (1973), Hudson (2002), and Albirini (2016) among others. While the discussion of the Arabic diglossic situation highlights the existence of two separate codes (High and Lo...
متن کامل